Query Steering for Interactive Data Exploration
نویسندگان
چکیده
Traditional DBSMs are suited for applications in which the structure, meaning and contents of the database, as well as the questions to be asked are already well understood. There is, however, a class of applications that we will collectively refer to as Interactive Data Exploration (IDE) applications, in which this is not the case. IDE is a key ingredient of a diverse set of discovery-oriented applications we are dealing with, including ones from scientific computing, financial analysis, evidence-based medicine, and genomics. The need for effective IDE will only increase as data are being collected at an unprecedented rate. IDE is fundamentally a multi-step, non-linear process with imprecise end-goals. For example, data-driven scientific discovery through IDE often requires non-expert users to iteratively interact with the system to make sense of and to identify interesting patterns and relationships in large, amorphous data sets. To make the most of the increasingly available complex and big data sets, users would need an “expert assistant” who would be able to effectively and efficiently guide them through the data space. Having a human assistant is not only expensive but also unrealistic. Thus, it is essential that we automate this task. We propose database systems be augmented with an automated “database navigator” (DBNav) service that assists as a “tour guide” to facilitate IDE. Just like a car navigation system that offers advice on the routes to be taken and display points of interest, DBNav would similarly steer the user towards interesting “trajectories” through the data, while highlighting relevant features. Like any good tour guide, DBNav should consider many kinds of information; in particular, it should be sensitive to a user’s goals and interests, as well as common navigation patterns that applications exhibit. We sketch a general data navigation framework and discuss some specific components and approaches that we believe belong to any such system.
منابع مشابه
AIDE: An Automated Sample-based Approach for Interactive Data Exploration
In this paper, we argue that database systems be augmented with an automated data exploration service that methodically steers users through the data in a meaningful way. Such an automated system is crucial for deriving insights from complex datasets found in many big data applications such as scientific and healthcare applications as well as for reducing the human effort of data exploration. T...
متن کاملComputational Steering
Computational steering is the online management of the execution of an application and its resources for the purpose of either performance improvement or application exploration. Generally, visualizations are used to provide the user with insight into the state and behavior of the underlying system, and as a feedback mechanism, enabling users to gauge the effectiveness of these parameter adjust...
متن کاملGraphVista: Interactive Exploration Of Large Graphs
The potential to gain business insights from graph-structured data through graph analytics is increasingly attracting companies from a variety of industries, ranging from web companies to traditional enterprise businesses. To analyze a graph, a user often executes isolated graph queries using a dedicated interface—a procedural graph programming interface or a declarative graph query language. T...
متن کاملA Progressive Query Materialization for Interactive Data Exploration
Data analysis is task of competently digging out data insights even if user is uncertain but database systems unable to meet. User’s Iterative interactions with system might be an alternative. Interactive data exploration (IDE) is one such system, supports exploration by incorporating user intention. IDE is key ingredient of many discovery applications, such as scientific computing, financial a...
متن کاملA Compilation Technique for Interactive Ontology-mediated Data Exploration
We make a step towards the interactive exploration of data in the ontology-mediated setting, presenting a technique to construct an offline compilation that allows us to answer efficiently different related queries in online phase, without the need to access again the original data. We also propose algorithms to construct relevant variations of a given query that, for example, reduce or increas...
متن کاملImmersive Visualization Using AVS/Express
To achieve benefit and value from high performance computing (HPC), visualization is employed for a direct understanding of computational results. The more closely coupled the visualization system can be to the HPC data source, the better the chance of fostering discovery and insight. A software architecture is described, with discussion of the dynamic object manager allowing scalable high-perf...
متن کامل